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System and method for focussed web crawling 

US Pat. 6418433 - Filed Jan 28, 1999 - International Business Machines Corporation 
Each Web page is electronically :5. stored in a respective Web site on a computer, 
... When a person wishes to retrieve information, the person's browser ... 

Web search engine with graphic snapshots 
US Pat. 6643641 - Filed Apr 27, 2000 

On the - A level of the browser, the user can opt to display ... Keywords intended 
to cause the page to be selected and to rate highly in particular ... 

Text indexing system to index f query the archive database document by keyword data 

representing the content of the documents and by contact data associated with the participant 

who generated the document 

US Pat. 7082427 - Filed May 24, 2001 - ReachForce, Inc. 

The results of these queries are delivered as business charts (such as bar charts 

or pie charts) in a web browser environment to the end users. ... 

Autonomous citation indexing and literature browsing using citation context 
US Pat. 6289342 - Filed May 20, 1998 - NEC Research Institute, Inc. 
Document Acquisition and Database Initialization citeseer can be used to create 
... broad keywords to crawler module 12 65 using a web browser interface 18. ... 

Autonomous citation indexing and literature browsing using citation context 
US Pat. 6738780 - Filed May 16, 2001 - NEC Laboratories America, Inc. 
Document Acquisition and Database Initialization 60 citeseer can be used to ... 
broad keywords to crawler module 12 8 using a web browser interface 18. ... 

Information transmission, information display method and information display apparatus 

US Pat. 6891859 - Filed Aug 26, 2002 - Kunihiro Hyakutake 

Check of the Key's Valid Period 10 15 When the browser is started up, ... 

for accessing a corresponding page to a click operation to a web page button. ... 

XX XX XX 

US Pat. 7099490 - Sony Corporation 

... thus permitting more accurate identification of the route of acquisition of 
... URL is instructed by the user, instructs the Web browser 303 to access ... 
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Patents Patents 1 - 10 on indexing web page data browser new web pages indexing. (0.36 seconds) 

System and method for creating a dynamic data file from collected and filtered web pages 

US Pat. 6449636 - Filed Sep 8, 1999 - Nortel Networks Limited 

Browser 150 displays the formatted results as a Web page 170 the Web spider 

for a search engine scans Web pages and returns for indexing purposes the ... 



Rules-based identification of items represented on web pag es 

US Pat. 7085736 - Filed Feb 27, 2001 - Alexa Internet 

In one embodiment, the web browser 106 executes the additional client application 
code embedded in a new client web page received from the data server 122. ... 

Data processing system and method for internet browser history generation 

US Pat. 6310630 - Filed Dec 12, 1997 - International Business Machines Corporation 
In an embodiment of the present web page is already known to the browser, ... 
Then functions, including the application program interface a new data record ... 



Method for parsing, indexing and searching world-wide-web pages 

US Pat. 6021409 - Filed Sep 8, 1998 - Digital Equipment Corporation 

If the page is gone, the browser 20 can inform the maintenance module 80. ... 

the data structures 1001 will have their locations expressed in "new" space, ... 



Method for parsing, indexing and searching world-wide-web pag es 

US Pat. 5864863 - Filed Aug 9, 1996 - Digital Equipment Corporation 

If the page is gone, the browser 20 can inform the maintenance module 80. ... 

Therefore, associated with each data structure 1001 is an "old/new" indication ... 



Sending to a central indexing site meta data or signatures from objects on a computer network 

US Pat. 6516337 - Filed Oct 14, 1999 - Arcessa, Inc. 

The indexing system 200 may also perform ranking of web pages having references 
... based upon whether a page is a source or reference to the desired data. ... 



Method for using agents to create a computer index corresponding to the contents of 
networked computers 

US Pat. 6976053 - Filed May 23, 2000 - Arcessa, Inc. 

Each time the spider visits a web page, the central index is updated so ... 

While the spider is busy indexing these new web pages, it cannot revisit old web ... 



Graphical search engine visual index 

US Pat. 6271840 - Filed Sep 24, 1998 

The data stream through web pages. For example, using javascript, ... link and 
a new window would conjunction with the browser 94, allows the data stream to ... 



Building a database of CCG values of web pages from extracted attributes 

US Pat. 6466940 - Filed Feb 1 1 , 1 998 

As new words are found, the new word is added as a new row in the word column 
... The example indexing program opts to index all other CCG-data contained in ... 



Method for indexing duplicate database records using a full-record fingerprint 

US Pat. 5745900 - Filed Aug 9, 1996 - Digital Equipment Corporation 

The browser programs 112 allow data structures 71 . and summary data structures 

72-73. The retrieved. Typically, the address of a Web page is specified 20 ... 
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Patents Patents 11 - 20 on indexing web page data browser new web pages indexing. (0.19 seconds) 

Sequential searching of a database index using constraints on word-location pairs 

US Pat. 5745890 - Filed Aug 9, 1996 - Digital Equipment Corporation 

Typically, the address of a Web page is specified 20 data structures 71, ... 

to add new entries while queries browser 20, a parsing module 30, an indexing ... 



Internet website traffic flow analysis using timestamp data 

US Pat. 6766370 - Filed Feb 8, 2002 - WebSideStory, Inc. 

The next field is a site visit count, which is incremented with each new page of 

the site that is requested by the browser during a current visit. ... 

Method for indexing duplicate records of information of a database 

US Pat. 5970497 - Filed Apr 27, 1998 - Digital Equipment Corporation 

The browser programs 112 allow the users to enter addresses of specific Web pages 

200 to be retrieved. Typically, the address of a Web page is specified as ... 

Method for promoting contextual information to display pages containing hyperlinks 

US Pat. 6763496 - Filed Mar 31, 1999 - Microsoft Corporation 

This step creates new meta-data entries for new documents, ... can be used will 

be apparent to those skilled in the art of web page design and development. ... 



Method for indexing duplicate records of information of a database 

US Pat. 6230158 - Filed Oct 19, 1999 - Altavista Company 

The browser programs 1 1 2 allow the users to enter addresses of specific Web pages 
200 to be retrieved. Typically, the address of a Web page is specified 30 ... 



Method and system for annotating information resources in connection with browsing, in both 

connected and disconnected states 

US Pat. 6697838 - Filed May 3, 2000 - Software Leader, LLC 

The first Web browser 60 is being used to access and display Web page 50a ... 

note data 136a-d of the note files soa-d associated with the Web pages 5Qa-d ... 



Retrieving, organizing, and utilizing networked data using databases 
US Pat. 6799174 - Filed Sep 17, 2001 - Science Applications International Corporation 
Once portal 201 has received new pages, a variety of methods exist for ... 
that data and the special protocol tags, and transfer that page 404 to web server ... 

Method for maintaining an index 

US Pat. 5765168 - Filed Aug 9, 1996 - Digital Equipment Corporation 

The browser programs 112 allow the users to enter addresses of specific Web pages 

200 to be 15 retrieved. Typically, the address of a Web page is specified ... 



System for adding new entry to web page table upon receiving web page including link to 
another web page not having corresponding entry in web page table 

US Pat. 5974455 - Filed Dec 13, 1995 - Digital Equipment Corporation 

A user accesses documents stored on the WWW using a Web browser (a computer ... 

for each Web page entry, a data file representing 30 million Web pages would ... 



Method for sampling a compressed index to create a summarized index 
US Pat. 5765158 - Filed Aug 9, 1996 - Digital Equipment Corporation 
The browser' programs 112 allow compressed data structure 71 is a compression of 
the word the users to enter addresses of specific Web pages 200 to be ... 
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Patents Patents 21 - 30 on indexing web page data browser new web pages indexing. (0.24 seconds) 
Constrained searching of an index 

US Pat. 6105019 - Filed Jul 26, 1999 - Digital Equipment Corporation 

The browser programs 112 allow the users to enter addresses of specific Web pages 

200 to be retrieved. Typically, the address of a Web page is specified as ... 



Method for generating a compressed index of information of records of a database 

US Pat. 6016493 - Filed Apr 3, 1998 - Digital Equipment Corporation 

The browser programs 112 allow the users to enter addresses of specific Web pages 

200 to be retrieved. Typically, the address of a Web page is specified as ... 



Method for searching an index 

US Pat. 5966710- Filed Sep 9, 1998 - Digital Equipment Corporation 

The browser programs 112 allow the users to enter addresses of specific Web pages 

200 to be retrieved. Typically, the address of a Web page is specified 35 ... 

Method for searching an index 

US Pat. 5832500 - Filed Aug 9, 1996 - Digital Equipment Corporation 

The browser programs 112 allow the users to enter addresses of specific Web pages 

200 to be retrieved. Typically, the address of a Web page is specified as ... 



System for adding a new entry to a web page table upon receiving a web page including a link 
to another web page not having a corresponding entry in the web page table 

US Pat. 6032196 - Filed Aug 28, 1998 - Digital Equipment Corporation 

A user accesses documents stored on the WWW using a Web browser ... each Web page 

entry, a data file representing 30 million Web pages would occupy about 3 ... 

System and method forfocussed web crawling 

US Pat. 6418433 - Filed Jan 28, 1999 - International Business Machines Corporation 
Each Web page is electronically :5 stored in a respective Web site on a computer, 
... the Web, to index new pages and to periodically revisit old pages that ... 

Method for optimizing entries for searching an index 

US Pat. 6047286 - Filed Aug 21, 1998 - Digital Equipment Corporation 

The browser programs 112 allow the users to enter addresses of specific Web pages 

200 to be retrieved. Typically, the address of a Web page is specified 40 ... 



Method for optimizing entries for searching an index 

US Pat. 5852820 - Filed Aug 9, 1996 - Digital Equipment Corporation 

The browser programs 112 allow the users to enter addresses of specific Web pages 

200 to be 30 retrieved. Typically, the address of a Web page is specified ... 

Method for mapping an index of a database into an array of files 

US Pat. 5963954 - Filed Jul 28, 1998 - Digital Equipment Corporation 

The browser programs 112 allow the users to enter addresses of specific Web pages 

200 to be 40 retrieved. Typically, the address of a Web page is specified ... 



Peer-to-peer automated anonymous asynchronous file sharing 

US Pat. 6675205 - Filed Jul 20, 2001 - Arcessa, Inc. 

The indexing system 200 may also perform ranking of web pages having ... 

for example, the location of the word in the web page and the position of the word ... 
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Patents Patents 31 - 40 on indexing web page data browser new web pages indexing. (0.14 seconds) 

Technique for deleting duplicate records referenced in an index of a database 
US Pat. 6745194 - Filed Aug 3, 2001 - Alta Vista Company 

The browser programs 112 allow the users to enter addresses of specific Web pages 
200 to be retrieved. Typically, the address of a Web page is specified 40 ... 

System, method, and medium for retrieving, organizing, and utilizing networked data 
US Pat. 6038668 - Filed Jul 22, 1998 - Science Applications International Corporation 
Tool suite 508 may also receive additional pages 515 from 25 web server 504 or 
pages 517 from browser 305. Data 516 and data 518 may follow similar supply ... 

System, method, and medium for retrieving, organizing, and utilizing networked data 
US Pat. 6292894 - Filed Nov 23, 1999 - Science Applications International Corporation 
Tool suite 508 may also receive additional pages 515 from 25 web server 504 or 
pages 517 from browser 305. Data 516 and data 518 may follow similar supply ... 



Method for mapping an index of a database into an array of files 

US Pat. 5787435 - Filed Aug 9, 1996 - Digital Equipment Corporation 

The browser programs 112 allow the users to enter addresses of specific Web pages 

200 to be retrieved. Typically, the address of a Web page is specified as ... 

Peer-to-peer automated anonymous asynchronous file sharing 

US Pat. 7032000 - Filed Oct 17, 2003 - Arcessa, Inc. 

In automated 60 web pages having references in the central index. ... the location 
of the word in the web page and frequency is greater than some threshold ... 



Technique for indexing information stored as a plurality of records 

US Pat. 5966703 - Filed Apr 3, 1998 - Digital Equipment Corporation 

For example, the bulk of the Web pages are expressed in the English language. 

... If the page is gone, the browser 20 can inform the maintenance module 80. ... 



Method for indexing information of a database 

US Pat. 5745899 - Filed Aug 9, 1996 - Digital Equipment Corporation 

If the page is gone, the browser 20 can inform structures 1001, the updating ... 

in a subsequent tier are bulk of the Web pages are expressed in the English ... 

Text in anchor tag of hyperlink adjustable according to context 

US Pat. 6735739 - Filed Jan 29, 1998 - International Business Machines Corporation 

A Web browser "parses" the HTML script in order to display the text in ... 

In such Web pages, it has been found useful to provide an index Web page where ... 



Method and apparatus for identifying spoof documents 

US Pat. 6442606 - Filed Aug 12, 1999 - Inktomi Corporation 

A Web page is the 25 image that is displayed to a user when a particular HTML file 

... the Internet to identify both new and updated documents for indexing. ... 



Information access system and method for archiving web pag es 
US Pat. 6625624 - Filed Dec 30, 1 999 - AT&T Corp. 

Qne for contents and another for indexing. Each through a web site for a set of 
... such as the web function for scanning data cached on a browser's cache ... 
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Patents 1 - 10 on new web page data. (0.41 seconds) 



System for identifying new web pages of interest to a user 
US Pat. 5933827 - Filed Sep 25, 1996 - International Business Machines Corporation 
Consequently, the selected data web page will be sent directly to the client ... 
to each data web page and is periodically updated with new data web pages. ... 

Modifying a shared resource 

US Pat. 6917950 - Filed Jan 10, 2001 - Intel Corporation 

This new web page includes a merger of the modified data 60 120 and the current 
data 122. A further explanation of merging data into this new web page is ... 

Display screen and window size related web page adaptation system 
US Pat. 6300947 - Filed Jul 6, 1998 - International Business Machines Corporation 
Web page data 800 consists of pages that are defined in a single URL/CGI file. 
... clicking a mouse on these links), new web pages are generated from web ... 

Updating of embedded links in World Wide Web source pages to have the new URLs of their 
linked ... 

US Pat. 6606653 - Filed Oct 7, 1999 - International Business Machines Corporation 

... WEB PAGES AND THE UPGRADE TO SAID ACTIVATED EMBEDDED LINKS TO ACCESS SAID NEW 
URLS PROVIDE SECURITY MEANS FOR LIMITING ACCESS OF SOURCE WEB PAGE TO DATA ... 

System and method for web browsing 

US Pat. 6313855 - Filed Feb 4, 2000 - Browse3d Corporation 

In operation, when user 110 selects a new web page, web browser searches the web 
page data associated with the new web page for any hyperlinks 240 included ... 

Printing system and printing method using the printing system 
US Pat. 7180616 - Filed Jan 26, 2001 - Fuji Xerox Co., Ltd. 
Thus, according to the first printing method, the Web page data are obtained by 
... in the Web page should be updated for 45 displaying a new Web page or ... 

Intelligent method, apparatus and computer program product for automated refreshing of 
internet ... 

US Pat. 6275858 - Filed Jan 4, 1999 - International Business Machines Corporation 
... said stored page data and for each said user selected internet web page, ... 
new page; checking for said 4Q user selected new page in local memory; ... 

Method for using agents to create a computer index corresponding to the contents of 
networked ... 

US Pat. 6976053 - Filed May 23, 2000 - Arcessa, Inc. 

Each time the spider visits a web page, the central index is updated so that ... 

as more web pages are added, the spider must visits these new web pages and ... 



Internet website traffic flow analysis using timestamp data 



US Pat. 6766370 - Filed Feb 8, 2002 - WebSideStory, Inc. 

... with image source data for the graphical element of the requested page and 

... wherein the traffic path analysis data indicates a new web site visit has ... 



Integrated method for creating a refreshable Web Query 

US Pat. 6948134 - Filed Mar 27, 2001 - Microsoft Corporation 

The initial Web page may be any Web site that the user has set as their "home 

... tabular data to import into a New Web Query dialog box described above. ... 
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Patents 1 - 10 on new url data collection. (0.38 seconds) 



Clickstream data collection technique 

US Pat. 7003565 - Filed Apr 3, 2001 - International Business Machines Corporation 
After computing a new URL token for the local cookie jar 30 (Block 680) or ... 
As a result of the present invention, clickstream data collection may be ... 

Web-based collaborative data collection system 

US Pat. 7076521 - Filed Jun 15, 2001 - Vertical Computer Systems, Inc. 

15 Once the URL table is opened the poller steps through the entries in the URL 

... Data type informa- 55 tion would typically be used where a new table is ... 

Shared-data environment in which each file has independent security properties 

US Pat. 5930801 - Filed Oct 30, 1997 - Xerox Corporation 

The object for a collection includes therein, among other metadata, a list of 

the handles of ... This new URL is then sent to the browser of the user 10, ... 

Data collection system and method for reducing latency 

US Pat. 6950845 - Filed Oct 23, 2001 - Amdocs (Israel) Ltd. 

The new enhancement procedure can then be automati- jrj cally populated with ... 

service, date/time, and URL If the NSP wants to collect session data for ... 

System for selectively requesting data from a server based on prior accepted requests ... 
US Pat. 6170016 - Filed Dec 9, 1998 - International Business Machines Corp. 
Accordingly, collection of the background URL contents by the automatic content 
collection part 164 is essentially inhibited by inhibiting issuance of a new ... 

Unified data acquisition system 

US Pat. 6681 198 - Filed Aug 16, 2001 - VelQuest Corporation 

The meta-data area will show the URL from the procedure and a key field. ... 

Alternatively, a data collection dialogue can be displayed showing appropriate ... 

Method and system for delivering data from a server object to a client object using a non ... 

US Pat. 6591305 - Filed Jun 30, 1998 - Sun Microsystems, Inc. 

The servlet 200 intercepts the URL request and returns the next unviewed 10 ... 

and providing for a data collection thread that retrieves the plurality of ... 

Method and system for managing distributed data 

US Pat. 6182111 - Filed May 15, 1998 - Hitachi, Ltd. 

The new URL flag is set with truth (928). The updating procedure of the communications 
... Consider a collection of urls frequently accessed consecutively, ... 

Integrating the telephone network and the internet web 

US Pat. 6430175 - Filed May 5, 1998 - Lucent Technologies Inc. 

The web server analyzes the data in the new URL, and determines whether the ... 

entitled "Forward Number Collection Form", contains a single form called " ... 



Interactive debugging system with debug data base system 

US Pat. 6938245 - Filed Oct 28, 1998 - Veritas Operating Corporation 

The results of the new query reflect the garbage collection. ... universal resource 

locator (URL) which specifies the server and in many cases attached data ... 
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